Features for Generic Corpus Querying
نویسندگان
چکیده
The availability of large corpora for more and more languages enforces generic querying and standard interfaces. This development is especially relevant in the context of integrated research environments like CLARIN or DARIAH. The paper focuses on several applications and implementation details on the basis of a unified corpus format, a unique POS tag set, and prepared data for word similarities. All described data or applications are already or will be in the near future accessible via well-documented RESTful Web services. The target group are all kinds of interested persons with varying level of experience in programming or corpus query languages.
منابع مشابه
An Investigation of the Generic Features of Research Articles Published in the Bulletin of Iranian Mathematical Society
In light of the understanding that the analysis of the generic features of different academic genres can enhance the ability of non-native members of academic discourse communities to understand, and where appropriate, to produce them, the present study aimed at investigating the dominant generic structure of research articles in mathematics. To start with a relatively narrow focus, a corpus of...
متن کاملExamining the Generic Features of Thesis Acknowledgments: A Case of Iranian MA Graduate Students Majoring in Teaching to Speakers of Other Languages (AZFA) and TEFL
Thesis acknowledgement is a written genre in which MA graduate students offer their gratitude to individuals, who have contributed to the completion of their study. The aim of the current study was to examine the thesis acknowledgements written by Iranian MA students in the field of Persian Language Teaching to Non-Persian Speakers (Amouzeshe Zaban e Farsi be Kharejian, AZFA) and TEFL in terms ...
متن کاملResearch Article Introductions: Sub-disciplinary Variations in Applied Linguistics
The present study aimed to investigate the generic organization of research article introductions in local Iranian and international journals in English for Specific Purposes, English for General Purposes, and Discourse Analysis. Overall, 120 published articles were selected from the established journals representing the above subdisciplines. Each subdiscipline was represented by 20 local and 2...
متن کاملA Cross-Disciplinary Genre Analysis of Rhetorical Features of Research Article Introductions Written by Iranians
The notion of genre has received a great deal of attention both in discourse analytic studies as well as in the field of ESP/EAP course design. The present paper has attempted to use genre analysis to account for the rhetorical features of research article introductions written by Iranian academics in two disciplinary fields of Education and Economics. The corpus comprised 40 research article i...
متن کاملExploring Sub-Disciplinary Variations and Generic Structure of Applied Linguistics Research Article Introductions Using CARS Model
This paper explores sub-disciplinary variations and generic structure of research article introductions (RAIs) within three sub-disciplines of applied linguistics (AL); namely, English for Specific Purposes (ESP), Psycholinguistics, and Sociolinguistics, using Swales’(1990) CARS model. The corpus consisted of 90 RAIs drawn from a wide range of refereed journals in the corresponding sub-discipli...
متن کامل